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1. REAL PARTY IN INTEREST 

Vermillion, Inc. (previously known as Ciphergen Biosystems, Inc.) and Eastern 
Virginia Medical School are assignees of the patent application from the inventors and are the 
Real Parties in Interest. 

2. RELATED APPEALS AND INTERFERENCES 

There are no related appeals, interferences, or judicial proceedings at this time. 

3. STATUS OF CLAIMS 

Claims 1, 8, 12, 20 and 84-94 are pending. Claims 2-7, 9-1 1, 13-19, and 21-83 
are canceled. Claims 1, 8, 12, 20 and 84-94 are being appealed. 

4. STATUS OF AMENDMENTS 

There have been no amendments to the pending claims after the final Office 

Action. 

5. SUMMARY OF CLAIMED SUBJECT MATTER 

Claim 1 is the only independent claim on appeal. The remaining claims are 
independent and are not being separately argued. The claimed subject matter is directed to the 
use of mass spectroscopy to distinguish between patients with prostate cancer and benign 
prostate hyperplagia [BPH]. 

It was discovered that persons having prostate cancer have elevated protease 
activity that leads to a marked shift in the proportion of small peptides in the cancer samples 
compared to those from patients with BPH (Figure 1). Protease activity refers to enzymes that 
degrade proteins. Using mass spectroscopy, data profiles are generated depicting a greater 
abundance of smaller proteins in samples from cancer patients than from persons suffering from 
BPH. The claims are directed to differentially diagnosing prostate cancer from BPH by 
observing molecular weight peak shifts in the MS profiles. An advantage of the claimed method 
is the elimination of the need to capture and identify specific proteins. 
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Support for claim 1 arises from the following parts of the specification (as 
originally filed) 

The preamble reciting: A method for diagnosing prostate cancer versus benign prostate 
hyperplasia, the method comprising:, finds support in the original claim 1 and in the 
specification at page 2, lines 18-24, page 26, line 8, page 27, and lines 22-23. 

The step of: obtaining from a subject a sample containing a plurality of 
prostate related protein markers having apparent molecular weights below 10,000 Da 
wherein the sample is selected from the group consisting of prostate tissue, blood, serum, 
semen, seminal fluid or seminal plasma;, finds support on page 2, lines 1 8-29, page 3 at lines 
14-16 (reciting subjects); page 26 lines 1-22 describing obtaining prostrate samples; at page 2, 
line 34 specifying 10 kDa as a preferred size; and original claim 8 reciting specific samples. 

The step of: determining by mass spectroscopy a test amount of the plurality 
of protein markers in the sample, the protein markers having an apparent molecular 
weight of less than 10,000 Da, finds support throughout the specification and expressly on page 
3, line 25 reciting MS and in the examples as depicted in Figure 3. 

The step of: comparing the test amount of the plurality of protein markers 
having apparent molecular weight of less than 10,000 Da with an amount of a plurality of 
protein markers having an apparent molecular weight of less than 10,000 Da from a 
control sample where the control sample originates from benign prostate hyperplasia 
patients, finds support through out the specification where determination of difference are set 
forth by comparing and expressly at pages 26-28 and at page 27, lines 14, 20 and 3 1 where 
comparison with control language is used. 

The last step of: determining whether the test amount is a diagnostic amount 
consistent with a diagnosis of prostate cancer versus benign prostate hyperplasia, finds 
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support in the original claim 1 and in the specification at page 2, lines 18-24, page 26, line 8, 
page 27, and lines 22-23. 

6. GROUNDS OF REJECTION TO BE REVIEWED ON APPEAL 

There is single ground of rejection presented for review. In the Final Office 
Action dated June 5, 2007, claims 1, 8, 12, 20 and 84-94 are rejected under 35 U.S.C. § 1 12 for 
alleged lack of enablement. The pending claims are rejected as non-enabled because the claims 
are not limited to the exemplified sample source, i.e., seminal plasma, and because the claims do 
not recite the nine specific protein markers that make up the majority of the mass spectroscopy 
profile of the examples provided in the specification. 

7. ARGUMENT 

A. Rejection and Examiner's Arguments 

The Examiner's position is that the claims are enabled if the claims recite 
"wherein a sample from seminal plasma having a protein characterized by molecular weight of 
2776 Da, 4423 Da, 4480 Da, 5753 Da, 6098 Da, 6270 Da, 7843 Da, 8030 Da, 8240 Da, and 
8714 Da." According to the Examiner, the pending claims are not enabled when the sample is 
from blood, prostate tissue, serum, semen and seminal fluid. Furthermore, the Examiner states 
that it would require undue experimentation to practice the invention by observing a shift in the 
peaks representing undefined proteins having a molecular weight under 10,000 Da. 

B. Legal Standards for Enablement 

It is well-settled in the biotechnology art that In re Wands, 858 F.2d 731, 8 
USPQ2d 1400 (Fed. Cir. 1988) sets the standard for enablement. As stated in Wands, 
"enablement is not precluded by the necessity for some experimentation, such as routine 
screening." In re Wands, 858 F.2d at 737, 8 USPQ2d at 1404 (Fed. Cir. 1988). The fact that 
experimentation may be complex does not render it undue. 
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As set forth by the Federal Circuit in In re Wands, 8 USPQ2d 1400, 1404 (Fed. 
Cir. 1988), multiple factors should be considered when determining whether any necessary 
experimentation is undue. These factors include: 

(a) the breadth of the claims; 

(b) the nature of the invention; 

(c) the state of the prior art; 

(d) the level of one of ordinary skill; 

(e) the level of predictability in the art; 

(f) the amount of direction provided by the inventor; 

(g) the existence of working examples; and 

(h) the quantity of experimentation needed to make or use the invention based on 
the content of the disclosure. 

C. Claims 1 , 8, 1 2, 20 and 84-94 are enabled. 

It does not require very much technical description to enable this invention. The 
claims only require that you produce an MS profile of the peaks representing proteins of less 
than 10,000 Da from the variously described samples. More specifically, you look for a shift in 
the proteins below a molecular weight of 10,000 Da towards lower molecular weight proteins. 
Ostensibly, persons with prostate cancer have an abundance of protease activity and this results 
in the ability to distinguish between patients with benign prostate hyperplagia [BPH] and 
prostate cancer. BPH is an enlargement of the prostate commonly seen in older men. 

The Examiner's two concerns are the failure of the appellants to provide results 
for samples other than seminal plasma, and failure to recite nine specific markers in the claims. 
With regard to the first concern, post-filing publications have been submitted along with a Rule 
132 Declaration by Dr. Tai-Tung Yip describing the use of both blood and prostate samples to 
distinguish between BPH and prostate cancer (see Evidence Appendix-page 15). The Examiner 
is silent as to why this evidence does not fully address his concern. 

Secondly, the recitation of specific markers in the claims will deny the appellants 
the benefit of their true contribution. The invention is not the discovery of specific markers for 
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detecting prostate cancer. Rather, the invention is the discovery that MS can be used to cost 
effectively distinguish between BPH and prostate cancer by detecting elevated levels of 
generalized protein degradation. This is in marked contrast to marker-specific assays of the prior 
art. The beauty of MS profiling is that the device generates a smooth data curve where shifts in 
the position of peaks are readily observable. With MS, the doctors don't have to determine 
which proteins make up the peaks. Figure 1 of the subject application provides a graphic 
example of this shifting in peaks. 

We will now turn our attention to the Examiner's concerns in the context of the 
Wands' Factors. It will be explained that the Examiner's Wands' concerns are fully addressed by 
Dr. Yip's Rule 132 Declaration and attached evidence, or they are based upon irrelevant facts 
that even if accurately stated do not have an impact on the legal question of whether the claims 
are enabled. 

The nature of the invention 

According to the Examiner, the invention is classified within the unpredictable art 
of chemistry and biology. Beyond citing to Mycogen Plant Sci, Inc., v. Monsanto Co., 243 F3d. 
1316; 58 U.S.P.Q.2D (BNA) 1030 (Fed Cir. 2001), the Examiner says nothing more. In fact, 
the Federal Circuit in Mycogen did not make a flat out statement that biology and chemistry are 
unpredictable arts with regard to enablement. The Federal Circuit was explaining that 
simultaneous conception and reduction to practice occurs in unpredictable arts such as chemistry 
and biology. 

In the context of this invention, it is true that the relevant scientific findings were 
not predictable from the prior art and thus the invention is non-obvious. But once discovered, 
the use of MS to distinguish between prostate cancer and BPH becomes routine, predictable and 
reproducible. Wands requires nothing more and in the absence of anything more specific by the 
Examiner, his bald statements classifying the invention as within an unpredictable art fails to 
support the rejection of the pending claims as non-enabled. 
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Level of skill in the art 

Appellants agree that the level of skill is high. Although the Examiner refers to 
Ph.D. or M.D. skills, mass spectroscopy is a routine procedure commonly conducted by trained 
technicians. 

The breadth of the claims 

In this section, the Examiner merely restates the pending independent claim. This 
recitation adds nothing to the calculus of whether the claim scope is overly broad. Beyond 
describing the claim as "broad," the Examiner is silent on any reason or rationale as to why one 
of skill could not practice the invention as claimed. 

In contrast, the appellants have submitted a Rule 132 Declaration of Dr. Yip 
detailing the results with different samples and using different MS probes surfaces. (See the 
Evidence Appendix-page 15). 

Guidance in the specification and working examples 

In this section the Examiner goes into the details of the depth of teaching provided 
by the specification. He explains that the sample can be an absolute amount or a relative 
amount. The protein markers were identified using different MS probe surfaces and that the 
apparent molecular weights of the proteins being evaluated differ according to the probe being 
used. 

The sole concern raised here was that only one test sample source, seminal 
plasma, was exemplified. This concern was addressed by appellants in a submission of a 
declaration and post filing publications describing similar results using blood and prostate tissue 
samples. 

The Examiner is silent as to why this submission of evidence did not dispel his 
concerns that it would require undue experimentation to practice the invention using the different 
sample sources recited in the claims. The MPEP §2164.04 clearly states that the initial burden is 
on the Examiner to explain why he believes that the other sample sources are not likely to work. 
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There only needs to be a "reasonable correlation" between the claim scope and the teachings in 
the specification. 

Despite the fact that the Examiner did not articulate any valid reasons beyond a 
vague statement of unpredictability, appellants supplied rebuttal evidence of post-filing success 
with other samples. The use of post-filing date evidence is expressly approved by the MPEP 
§2164.05 where the use of post- filing evidence is stated to be acceptable so long as the evidence 
is used to establish that disclosure of the method was sufficient when the application was filed. 
Sufficiency of disclosure is not an issue here. The suggestion of the source of the sample is a 
complete teaching. It should be apparent to the Board that the rationale for the §1 12 rejection is 
more of a utility/enablement rejection than a classic enablement rejection. 

Quantity of experimentation 

Here the Examiner makes an unsupported statement that the area of proteonomics 
is "extremely large." This is an irrelevant truth. The area of proteonomics is an exciting field of 
unknown potential with much need for experimentation. However, this truth is irrelevant to the 
issue before the Board. The relevant question is whether it requires undue experimentation to 
practice an invention requiring the steps of: (i) obtaining samples; and, (ii) quantifying the 
proteins in those samples having molecular weights below 10,000 Da using MS. 

Using MS to quantify the size of proteins is standard and routine MS work. 
Appellants see no objective reason why it would require undue experimentation to practice this 
invention, and the Examiner has not articulated any rationale beyond a general and unsupported 
statement regarding the unpredictable nature of proteonomics, chemistry and biology. 

The unpredictability of the art and the state of the prior art 
In this section the Examiner sets forth a number of irrelevant truths and then 
argues that the pending claims are not enabled. More specifically, the Examiner relies on 
Diamandis (2004), Diamandis (2005), and Grizzel (2005). Diamandis (2004/2005) describes 
potential problems in the analysis of serum protenomics patterns for detection of cancer. The 
problems include: 
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• how the markers are released into the host serum; 

• their relative abundance; and 

• the dynamic relationship between the host, the samples, the MS apparatus, 
and the bioinformatics used to provide the analysis. 

In the latter category, Diamandis states that markers may vary over time and progression of 
disease, that different research groups may provide different results; that sample storage and 
handling will effect the MS profiles of markers. Grizzle is cited for mentioning the same 
concerns and mentioning ethnicity, experimental design, spectral analysis as additional 
parameters that need to be evaluated. 

Appellants argue that these multiple concerns involve parameters that are 
commonly and routinely optimized for any diagnostic assay. None of these concerns, 
individually nor in combination, constitute undue experimentation with regard to whether one 
can distinguish between BPH and prostate cancer using MS profiles. Appellants have 
demonstrated that this can be done using 3 different probes and 3 different sample sources. 

The concerns raised by Diamandis and Grizzel are important concerns; but, they 
are not relevant inquiries for purposes of enablement under §112. Rather, they are relevant for 
creating a commercially acceptable kit having the specificity (positive predictive value) and 
sensitivity (negative predictive value) needed to pass muster with the FDA, not the USPTO {In 
re Watson, 517 F.2d 465 at 476; 186 U.S.P.Q. 11 (CCPA 1975)). 

Yes, the FDA will require the commercial producer of the assays to standardize 
the apparatus, the sample handling procedure and bioinformatics used to analyze the data before 
product release. The FDA will also ask the producer to test the assay in a variety of patients 
including perhaps patients of different ethnicities. But for patent purposes, we need only teach 
how to make and use the invention where the teachings are reasonably correlated with the scope 
of the claim to satisfy the enablement requirement. Appellants submit that the reasonably 
correlated standard has been fully met when the invention has been reproducibly demonstrated 
using three different samples and three different MS probes. 
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Examiner's conclusion. 

The Examiner provides a conclusion section on pages 9-10 of the Final Office 
Action. In the conclusion, the Examiner raises two additional points that bear comment by 
appellants. 

The claims must recite the nine masses because they make up the MS profile 
The Examiner's demand that the claims include a recitation of the nine peptide 
masses is at the heart of the issue on appeal. The Examiner argues that the appellants in the 
specification and in Dr Yip's Declaration (attached) have stated that the MS profiles are made up 
of 9 masses that are reliably and reproducibly detected. He then goes on to state that the origin 
of the sample, the probes used to capture the proteins for MS analysis and the post-filing date 
publications, all suggest that the masses are not reproducibly detected. 

The Examiner has correctly set forth the situation. The detected masses differ 
according to the various parameters being applied. What doesn't change is the shifting of the 
masses below 10,000 Da to lower molecular weights because of increased protease activity in 
prostate cancer patients compared to those suffering from BPH. This shift in low molecular 
weight proteins is appellants' invention. And the fact that changes in the specific parameters 
relating to sample preparation and MS analysis might result in different absolute masses does not 
dictate that the claims require undue experimentation. It does not require undue experimentation 
to standardize the sampling and MS parameters and detect a shift in mass size. 

The claims embrace samples from healthy people 

Finally, the Examiner notes for the first time that the claims are directed to 
distinguishing between BPH and prostate cancer in healthy people. Actually, the claims recite 
distinguishing between BPH and prostate cancer in subjects. Appellants respond by urging that 
common sense be applied here. The issue of whether the claims include distinguishing between 
prostate cancer and BPH in healthy people (both male and female) is not an enablement issue. 
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Virtually all patent claims embrace obvious non-working embodiments. For 
example, you cannot make a patented gas engine out of paper; but, a typical claim to novel 
engine parts does not recite the physical characteristics of the engine block. So long as the 
claimed method is taught in sufficient detail to distinguish between BPH and cancer in male 
patients exhibiting abnormal prostates without undue experimentation, the fact that the claimed 
assays could be conducted on healthy males or even females does not render the claim non- 
enabled. 

The question of how precisely to draft the preamble of claim 1 regarding the test 
subjects is an issue of drafting style. It does not give rise to a question of enablement. 
Appellants would be pleased to address such amendments after the Board has indicated that the 
claims are otherwise patentable without further limitation to seminal plasma nor to the nine 
markers of the examples. 

8. CONCLUSION 

For these reasons, it is respectfully submitted that the rejection should be 

reversed. 

Respectfully submitted, 

Kenneth A. Weber 
Reg. No. 31,677 

TOWNSEND and TOWNS END and CREW LLP 

Two Embarcadero Center, Eighth Floor 

San Francisco, California 94111-3834 

Tel: 415-576-0200 

Fax:415-576-0300 

61190136 v1 
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9. CLAIMS APPENDIX 

1 . (previously presented) A method for diagnosing prostate cancer versus benign prostate 
hyperplasia, the method comprising: 

i. obtaining from a subject a sample containing a plurality of prostrate related protein 
markers having apparent molecular weights below 10,000 Da wherein the sample is selected 
from the group consisting of prostate tissue, blood, serum, semen, seminal fluid or seminal 
plasma; 

ii. determining by mass spectroscopy a representative pattern of the quantity of a plurality 
of protein markers in the sample, the protein markers having an apparent molecular weight of 
less than 10,000 Da; 

iii. comparing the pattern of the plurality of protein markers having apparent molecular 
weight of less than 10,000 Da with an amount of a plurality of protein markers having an 
apparent molecular weight of less than 10,000 Da from a control sample where the control 
sample originates from benign prostate hyperplasia; 

and 

iv. determining whether the pattern of the sample is a diagnostic amount consistent with a 
diagnosis of prostate cancer versus benign prostate hyperplasia where the pattern consistent with 
a diagnosis of prostate cancer is represented by an increase in the quantity of lower molecular 
weight proteins. 

Claims 2-7 (canceled) 

8. (previously presented) The method of claim 1, wherein the seminal fluid sample is 
selected from the group consisting of semen and seminal plasma. 

Claims 9-11 (canceled) 
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12. (previously presented) The method of claim 1, the method further comprising: 

(a) generating data on the sample with the mass spectrometer indicating intensity of 
signal for mass/charge ratios; 

(b) transforming the data into computer-readable form; and 

(c) operating a computer to execute an algorithm, wherein the algorithm determines 
closeness-of-fit between the computer-readable data and data indicating a diagnosis of prostate 
cancer or a negative diagnosis. 

Claims 13-19 (canceled) 

20. (previously presented) The method of claim 1, wherein the sample is seminal plasma. 
Claims 21-83 (canceled) 

84. (previously presented) The method of claim 1 where the protein markers are 
adsorbed onto a probe comprising an adsorbent of a hydrophilic polymer. 

85. (previously presented) The method of claim 1 where the protein markers are 
adsorbed onto a probe comprising a metal binding group. 

86. (previously presented) The method of claim 84 where the adsorbent comprises a 
hydrophobic group. 

87. (previously presented) The method of claim 84 where the adsorbent comprises a 
cationic group. 

88. (previously presented) The method of claim 84 where the adsorbent comprises a 
metal ion chelating group. 

89. (previously presented) The method of claim 20, the method further comprising: 
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(a) generating data on the sample with the mass spectrometer indicating intensity of 
signal for mass/charge ratios; 

(b) transforming the data into computer-readable form; and 

(c) operating a computer to execute an algorithm, wherein the algorithm determines 
closeness-of-fit between the computer-readable data and data indicating a diagnosis of prostate 
cancer or a negative diagnosis. 

90. (previously presented) The method of claim 20 where the protein markers are 
adsorbed onto a probe comprising an adsorbent of a hydrophilic polymer. 

91. (previously presented) The method of claim 20 where the protein markers are 
adsorbed onto a probe comprising a metal binding group. 

92. (previously presented) The method of claim 90 where the adsorbent comprises a 
hydrophobic group. 

93. (previously presented) The method of claim 90 where the adsorbent comprises a 
cationic group. 

94. (previously presented) The method of claim 90 where the adsorbent comprises a 
metal ion chelating group. 
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10. EVIDENCE APPENDIX 

Rule 132 Declaration by Dr. Tai-Tung Yip with Exhibits 1-3 filed on June 26, 
2006, and acknowledged by the Examiner in the Office Action mailed on September 1 , 2006 
(attached to Appellants' Brief hereto after page 16). 



11. RELATED PROCEEDINGS APPENDIX 



None. 
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By /Jo Ann Honcik Dallara/ 
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Alexandria, VA 22313-1450 

I, Dr. Tai-Tung Yip, being duly warned that willful false statements and the like are 
punishable by fine or imprisonment or both, under 18 U.S.C. § 1001, and may jeopardize the 
validity of the patent application or any patent issuing thereon, state and declare as follows: 

1 . All statements herein made of my own knowledge are true and statements made 
on information or belief are believed to be true. The Exhibits (1 and 2) attached hereto are 
incorporated herein by reference. 
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2. I received a Ph.D. in Biochemistry from the Chinese University of Hong 
Kong, Faculty of Science in 1985. 

3 . I am presently employed by Ciphergen as a Senior Research Fellow and I am 
primarily responsible for clinical and basic research in SELDI-TOF-MS. 

4. I have read and am familiar with the contents of the subject patent application. I 
understand that the Examiner has a rejection of the pending claims based on enablement. The 
Examiner is concerned that practice of the invention, as previously claimed, would require undue 
experimentation to fully practice. More specifically, the Examiner was concerned about the 
reproducibility of our results. The primary concerns were over whether the surface chemistry of 
the MS probe would affect our results, whether the work was reproducible across a large patient 
population and whether we could distinguish between benign prostate hyperplasia and prostate 
cancer from sample sources beyond prostate serum (seminal fluid). Secondary concerns were 
whether sample handling, statistical analyses and patient conditions would affect our results. 

Below I address the primary concerns individually. The secondary concerns are 
addressed by a single response. 

5. IS THE PHENOMENON OF AN ABUNDANCE OF LOWER WEIGHT PROTEINS 
IN PROSTATE CANCER SAMPLES DETECTABLE WITH OTHER MS PROBE 
SURFACES? 

The answer is yes. We looked at three different surfaces to determine the answer to this 
very question. Attached to this declaration as Exhibit 1 are copies of MS profiles done using two 
other chip surfaces. Figure 6 of our patent application illustrates the results with an SCX1 
surface which is described in our specification at page 33, lines 26-3 1 and uses a sulfonated 
polystyrene as a capture agent. Exhibit 1 describes the similar serum samples studied on a 
Ciphergen IMAC Ni(II) chip and our H4 chip. The IMAC chip is described in detail on page 
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31, lines 16-23. Our H4 is a good general protein capture surface that mimics reversed phase 
chromatography with C16 functionality. H4 is described in our specification at page 32, lines 
15-32. 



From Exhibit one, it should be clear that the generation of low molecular weight peptide 
products is observed in our mass spectroscopic instruments using three different capture surfaces. 
Obviously, the surface chemistry needs to have the capacity to capture an appropriate range of 
proteins. 



6. IS THE PHENOMENON OF AN ABUNDANCE OF LOWER WEIGHT PROTEINS 
IN PROSTATE CANCER SAMPLES DETECTABLE IN OTHER PATIENTS? 

The answer is yes. I have attached two papers reporting on work by my co-inventors 

using Ciphergen mass spectroscopy equipment that looked at this very question. The Adam et al. 

study (Exhibit 2) used samples from hundreds of patient serum samples including control 

patients, patients with BPH and those with two types of prostate cancer. While the work reported 

on MS protein fingerprinting coupled with a sophisticated pattern matching algorithm, the results 

accurately reflect our earlier work. The authors wrote on page 3609, 2nd col: 

Using a standardized test set, we demonstrate 
proof of principle that our SELDI protein profiling 
approach can accurately discriminate PCA from patients 
with BPH and men of the same age who do not have 
prostate disease. 

The authors also write on page: 

The successful use of the prostate classification 
system described herein relies entirely on the protein 
fingerprinting of the nine masses. Because these 
masses were found to be reproducibly reliably 
detected, only the mass values are required to make a 
correct classification or diagnosis. Knowing the 
identities for the purpose of differential diagnosis 
is not required. 
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Finally, evidence of the low molecular weight peptide products as prostate cancer 
markers can be clearly seen in Figure 2 (c) on page 361 1 . 

Cazares et al. (Exhibit 3) provides similar evidence of successful application of our 

invention using prostate tissues samples from 9 patients. See the results section where the 

authors (including co-inventor George Wright) wrote in the abstract on page 2541 : 

Results: Several small molecular mass peptides or 
proteins (3000-5000 Da) were found in greater 
abundance in PIN and PCA cell lysates . 

7. IS THE PHENOMENA OF AN ABUNDANCE OF LOWER WEIGHT 
PROTEINS IN PROSTATE CANCER PATIENTS DETECTABLE IN SAMPLES OTHER 
THAN SERUM? 

Exhibits 2 and 3 provide evidence that the invention is applicable to samples other than 
from seminal fluid. Exhibit 2 shows data from blood serum and Exhibit 3 provides data from 
prostate tissue biopsies. 

8. THE REMAINING SECONDARY CONCERNS ARE ISSUES THAT ARE 
ROUTINELY ADDRESSED BY COMPETENT LABORATORY TECHNICIANS AND DO 
NOT INVOLVE UNDUE EXPERIMENTATION. 

The Examiner takes note of several papers raising concerns about the use of SELI-TOF- 
MS for diagnostic purposes. These include Grizzle et al. which includes co-inventor George 
Wright's colleague, John Semmes as co-author and by Diamandis. None of the papers 
challenges the use of MS to accurately detect proteins and for the results to reflect differences in 
disease states. Grizzle mentions issues with instrumentation drift that might make routine 
clinical diagnosis more complicated than if measured by individual peaks, a problem this 
invention avoids. Grizzle also mentions that patterns of eating, age and familial relationships 
may influence the results. In addition, site collection and storage of samples including the 
containers are important considerations. Grizzle also mentions that large and abundant proteins 
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such as albumin can also bind small proteins and that the removal of albumin might cause under 
representation of the low molecular markers. Diamandis make comments of similar character. 

In response, applicants fully acknowledge these issues as relevant ones. It is hoped that 
the patient samples described by Adams et al. and Cazares et al. addresses most of these 
concerns. However, the fact that a cheap cup might leach plastic or vinyl components and render 
the samples useless or that sample handling must be uniform and be in a stable environment to 
avoid sample degradation, are issues for any assay. But, I submit that these issues are not of the 
type that give rise to enablement concerns for patent claims. They are obvious problems to avoid 
or to solve if one simply follows the preferred methods described in the specification. Thus the 
stated problems should not be of serious concern in that they are not problems requiring undue 
experimentation to avoid. 



This Declarant has nothing further to say. 
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Serum Protein Fingerprinting Coupled with a Pattern-matching Algorithm 
Distinguishes Prostate Cancer from Benign Prostate Hyperplasia 
and Healthy Men 1 

Bao-Ling Adam, 2 Yinsheng Qu, 2 John W. Davis, Michael D. Ward, Mary Ann Clements, Lisa H. Cazares, 
O. John Semmes, Paul F. Schellhammer, Yutaka Yasui, Ziding Feng, and George L. Wright, Jr. 2,3 
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Abstract 

The prostare-speciflc antigen test has been a major factor In increasing 
awareness and better patient management of prostate cancer (PCA), but 
its lack of specificity limits its use in diagnosis and makes for poor early 
detection of PCA. The objective of our studies is to identify better bio- 
markers for early detection of PCA using protein profiling technologies 
that can simultaneously resolve and analyze multiple proteins. Evaluating 
multiple proteins will be essential to establishing signature proteomic 
patterns that distinguish cancer from noncancer as well as identify all 
genetic subtypes of the cancer and their biological activity. In this study, 
we used a protein biochip surface enhanced laser desorption/ionization 
mass spectrometry approach coupled with an artificial intelligence learn- 
ing algorithm to differentiate PCA from noncancer cohorts. Surface en- 
hanced laser desorption/ionization mass spectrometry protein profiles of 
serum from 167 PCA patients, 77 patients with benign prostate hyperpla- 
sia, and 82 age-matched unaffected healthy men were used to train and 
develop a decision tree classification algorithm that used a nine-protein 
mass pattern that correctly classified 96% of the samples. A blinded test 
set, separated from the training set by a stratified random sampling before 
the analysis, was used to determine the sensitivity and specificity of the 
classification system. A sensitivity of 83%, a specificity of 97%, and a 
positive predictive value of 96% for the study population and 91 % for the 
general population were obtained when comparing the PCA versus non- 
cancer (benign prostate hyperplasia/healthy men) groups. This high- 
throughput proteomic classification system will provide a highly accurate 
and innovative approach for the early detection/diagnosis of PCA. 

Introduction 

The number of PCA 4 cases has tripled during the past decade due 
to the widespread use of serum PSA testing and DRE (1). Although 
these efforts have allowed for increased identification of individuals 
with cancer, overall "early" detection or determination of aggressive 
cancers is needed. PSA is currently the best overall serum marker for 
PCA in clinical use. Nevertheless, the PSA test lacks specificity (2, 3), 
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limiting its use as an early detection biomarker, and its relation to 
biological activity has been questioned (4). It is important that addi- 
tional diagnostic biomarkers be identified to reduce PCA mortality. 
However, because of the robust molecular and cellular heterogeneity 
of PCA, it is likely that a combination or a panel of biomarkers will 
be required to improve the early detection of PCA. 

The study of the cell's proteome presents a new horizon for bi- 
omarker discovery. Two-dimensional PAGE has been the classical 
approach to explore the proteome for separation and detection of 
differences in protein expression (5, 6). Advances in two-dimensional 
gel electrophoresis technology coupled with robotics and software 
programs for identifying potential protein alterations have improved 
this proteomic system. Nevertheless, two-dimensional gel electro- 
phoresis is still cumbersome, labor intensive, suffers reproducibility 
problems, and is not readily transformed into a clinical assay. Ad- 
vances have also been made in mass spectrometry to achieve high- 
throughput separation and analysis of proteins (7-9). One of the recent 
advances is the ProteinChip system manufactured by Ciphergen Bio- 
systems, Inc. (Fremont, CA). This system uses SELDI time-of-flight 
mass spectrometry to detect proteins affinity-bound to a protein chip 
array (10, 11). This system is a novel, extremely sensitive, and rapid 
method to analyze complex mixtures of proteins and peptides. Initial 
studies from our laboratory established the potential of SELDI for 
discovery and profiling of prostate and bladder cancer biomarkers in 
body fluids and cell lysates (12, 13). 

The objective of this study was to determine whether SELDI 
protein profiling of serum coupled with an artificial intelligence data 
analysis algorithm could effectively differentiate PCA from BPH and 
unaffected HM. Using a standardized test set, we demonstrate proof of 
principle that our SELDI protein profiling approach can accurately 
discriminate PCA from patients with BPH and men of the same age 
who do not have prostate disease. Our results form the basis for 
initiating further evaluation and validation to assess the potential of 
this SELDI proteomic classification system for the early detection and 
diagnosis of PCA, and further study is warranted to establish profiles 
that identify the clinically important lethal cancers. 

Materials and Methods 

Serum Samples. Serum samples were obtained from the Virginia Prostate 
Center Tissue and Body Fluid Bank. Tlic serum procurement, data manage- 
ment, and blood collection protocols were approved by the Eastern Virginia 
Medical School Institutional Review Board. Blood samples from patients 
diagnosed with either PCA or BPH were procured from (he Department of 
Urology, Eastern Virginia Medical School, and the HM cohort was obtained 
from flee screening clinics open to the general public. Only pretreatment 
samples obtained at the time of diagnosis of PCA or BPH were used for this 
study. After obtaining informed consent from the patient, the sample was 
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collected into a 10-cc Scrum Separator Vacutainer Tube and ccntrifuged 30 
min later at 375,000 rpm tor 5 min. The serum was distributed into 500-^1 
aliquols and stored frozen at -80 C. A quality control sample was prepared by 
pooling un equal amount of scrum from each specimen of the age-matched MM 
group and storing l00-/il aliquols at -80°C. The quality control serum sample 
was used to determine reproducibility and as a control protein profile for each 
SELDI experiment. 

Patient and Donor Cohorts. Specimens from four groups of patients were 
used in this study: (a) 97 age-matched HM (control); (b) 92 patients with BPH; 
(c) 99 patients diagnosed with organ-confined PCA (T,/T 2 ); and (rf) 98 patients 
diagnosed with non-organ-confined PCA (T 3 /T 4 ). A donor was selected for the 
HM group if he had a normal DR.E, a PSA < 4.0 ng/ml, and no evidence of 
prostatic disease. The HM group consisted of 48 Caucasian and 48 African- 
American males ranging in age from 51-70 years (mean age, 60 years). There 
were 33 Caucasians, 2 African Americans, and 57 men of unknown race in the 
BPH patient group, ranging in age from 48-86 years (mean age, 67 years). The 
BPH patients were selected if they had PSA values between 4 and 10 ng/ml, 
low PSA velocities (i.e., PSA velocity <0.7 ng/ml/ycar), and multiple negative 
biopsies. The number of biopsies was two (73 cases), three (13 cases), and four 
(6 cases). The organ-confined PCA group (T,/T 2 ) consisted of 76 Caucasians, 
20 African Americans, I Asian, and 2 men of unknown race with ages ranging 
from 50-89 years (mean age, 71 years). For the non-organ-confined PCA 
group (T 2 /T 4 ), there were 80 Caucasians, 16 African Americans, and 2 men of 
unknown race, ranging in age from 44-87 years (mean age, 69 years). The 
range and mean PSA values for the groups were as follows: a 0.15-3.83 ng/ml 
(1.32 ng/ml) for the HM group [86 members of this group had a PSA < 2.5 
ng/ml (the latter were considered to be a low-risk group)]; (b) 0.0-10.91 ng/mi 
(4.60 ng/ml) for the BPH group; (c) 0.0-95.16 ng/ml (10.10 ng/ml) for the 
organ-confined PCA (T,/T 2 ) group; and (d) 0.0-8752 ng/ml (206.93 ng/ml) 
for the non-organ-confined PCA (T 3 /T 4 ) group. 

SELDI Protein Profiling. Various chip chemistries (hydrophobic, ionic, 
cationic, and metal binding) were initially evaluated to determine which 
affinity chemistry provided the best serum profiles in terms of number and 
resolution of proteins. The IMAC-Cu metal binding chip was observed to give 
the best results. IMAC-3 chips (Ciphergen Biosystems. Inc.) were coated with 
20 /il of 100 mM CuSCX, on each array, placed 011 a TOMY Micro Tube Mixer 
(MT-360; Tomy Seiko Co., Ltd.), and agitated for 5 min. The chips were rinsed 
10 times with DI water, and 20 11I of 100 mM sodium acetate were added to 
each array and shaken for 5 min to remove the unbound copper. The chips were 
rinsed again with DI water (10 times) and put into a bioprocessor (Ciphergen 
Biosystems, Inc.), which is a device that holds 12 chips and allows application 
of larger volumes of serum to each chip array. The bioprocessor was washed 
and shaken on a platform shaker at a speed of 250 rpm for 5 min with 200 itl 
of PBS in each well. This was repeated twice more, and each time the PBS 
buffer was discarded by inverting the bioprocessor on a paper towel. Serum 
samples for SELDI analysis were prepared by vortexing 20 /il of serum with 
30 ixl of 8 M urea/1% 3-[(3-cholamidopropyl)dimethylammonio]-l-propane- 
sulfonic acid in PBS in a 1.5-ml microfuge tube at 4°C for 10 min. One 
hundred ju.1 of 1 M urea with 0.125% 3-[(3-cholamidopropyl)dimethylammo- 
nio]-l-propanesuIfonic acid were added to the serum/urea mixture and vor- 
texed briefly. PBS was added to make a 1:5 dilution and placed on ice until 
applied to a protein chip array. Fifty ixl of the diluted serum/urea mixture were 
applied to each well, and the bioprocessor was sealed and shaken on a platform 
shaker at a speed of 250 rpm for 30 min. The serum/urea mixture was 
discarded, and the PBS washing step was repeated three limes. The chips were 
removed from the bioprocessor, washed 1 0 times with DI water, air dried, and 
stored in the dark at room temperature until subjected to SELDI analysis. 
Before SELDI analysis, 0.5 fil of a saturated solution of the EAM sinapinic 
acid in 50% (v/v) acetonitrile, 0.5% trifluoroacetic acid was applied onto each 
chip array twice, letting the array surface air dry between each sinapinic acid 
application. Chips were placed in the Protein Biological System II mass 
spectrometer reader (Ciphergen Biosystems, Inc.), and time-of-flight spectra 
were generated by averaging 192 laser shots collected in the positive mode at 
laser intensity 220, detector sensitivity 7, and a focus lag time of 900 ns. Mass 
accuracy was calibrated externally using the All-in-l peptide molecular mass 
standard (Ciphergen Biosystems, Inc.). 

Data Analysis. The data analysis process used in this study involved three 
stages: (a) peak detection and alignment; (b) selection of peaks with the 
highest discriminatory power; and (c) data analysis using a decision tree 



algorithm. A stratified random sampling with four strata (PCA (T,/T,), PCA 
(Tj/T 4 ), BPH, and HM] was used 10 separate the entire data set into training 
and test data sets before the analysis. The training data set consisted of SELDI 
spectra from 167 PCA, 77 BPH, and 82 normal scrum samples. The validity 
and accuracy of the classification algorithm were then challenged with a 
blinded test data set consisting of 30 PCA, 15 BPH, and 15 normal samples. 

Peak Detection. Peak detection was performed using Ciphergen SELDI 
software versions 3.0 fi and 3.O. 5 The mass range from 2,000-40,000 Da was 
selected for analysis because this range contained the majority of the resolved 
protein/peplides. The molecular masses from 0-2,000 Da were eliminated 
from analysis because this area contains adducls and artifacts of the EAM and 
possibly other chemical contaminants. Peak detection involved (a) baseline 
subtraction. (b) mass accuracy calibration, and (c) automatic peak detection. 
The software program calculates noise, peak areu, and filter based on the 
criteria selected by the operator for data analysis. The settings used for this 
study were as follows: (a) fitting window width = 100 data points; (b) average 
noise = 10 points; (c) peak area calculated using the slope-based method; (d) 
low minimum valley depth = 10 times noise; (<>) high minimum valley 
depth = 0.5 times noise; (J) low and high sensitivity of peak height = 10 and 
2 times noise, respectively; (g) auto peak detection slider = 8 for mass range 
2-4 kDa, 1 1 for mass range 4-8 kDa, and 8 for mass range 8-40 kDa. An 
average of 81 peaks was detected in each spectrum. 

Peak Alignment. All of the labeled peaks from 772 spectra were exported 
from SELDI to an Excel spreadsheet. A PeakMiner algorithm, 6 developed 
in-house, was used to align peaks and perform statistical analysis. Peaks were 
first sorted by mass, and a mass error value was calculated for each peak. The 
mass error score, the measurement of mass difference between peak X and 
peak X + I, is calculated for each peak using (Mpx - Mpx +l)/Mpx, where 
Mpx is the mass value of peak X. For example, if the mass error score was 
<0.18%, peak X and peak X + 1 would align into one peak, representing the 
same protein in each sample. If the mass error was >0.18%, then peak X and 
peak X + 1, would be considered two distinct peaks. This is an iterative 
process throughout all of the labeled peaks. 

Feature Selection. The power of each peak in discriminating normal 
versus PCA, normal versus BPH, and BPH versus PCA was determined by 
estimating the AUC, which ranges from 0.5 (no discriminating power) to 1.0 
(complete separation). 

Decision Tree Classification. Construction of the decision tree classifica- 
tion algorithm was performed as described by Breiman et at. (14) with 
modifications, 7 using a training data set consisting of 326 samples (82 normal, 
77 BPH, and 1 67 PCA samples). Classification trees split up a data set into two 
bins or nodes, using one rule at a time in the form of a question. The splitting 
decision is defined by presence or absence and the intensity levels of one peak. 
For example, the answer to "Does mass A have an intensity less than or equal 
to X" splits the data set into two nodes, a left node for yes and a right node for 
no. This splitting process continues until terminal nodes or leaves are produced 
or further splitting has no gain. Classification of terminal nodes is determined 
by the group ("class") of samples (i.e., PCA, BPH, or HM) representing the 
majority of samples in that node. A "cost" function is calculated that reflects 
the heterogeneity of each node: -log L = -Xnjlog(pj) where L is the 
likelihood of the multinomial distribution, nj is the number of samples in class 
j, and pj is the probability of class j. Peaks selected by this process to form the 
splitting rules are the ones that achieve the maximum reduction of cost in the 
two descendant nodes. 

Statistical Analyses. The AUC was computed to identify the peaks with 
the highest potential to discriminate the three groups, based on the probability 
that the test result from a diseased individual is more indicative of disease than 
that from a nondiseased individual (15). A Bayesian approach was used to 
calculate the expected probabilities of each class in each terminal node (16), 
and their 95% confidence intervals were calculated using the posterior Dir- 
ichlet distribution (16). The 95% confidence intervals were calculated by 
generating and sorting 4000 samples for the posterior Dirichlet distribution, 
and the 100 th and 3900"' sample were considered as the lower and upper 
bounds of the 95% confidence intervals, respectively. Specificity was calcu- 
lated as the ratio of the number of nondisease samples correctly classified to 
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(he total number of nondisease samples, Sensitivity was calculated at the ratio 
of the number of correctly classified diseased samples to the total number of 
diseased samples. The PPV for the study population was calculated by dividing 
the number of true PCA positives by the sum of the number of true PCA 
positives plus the number of false I'CA positives. The NPV for the study 
population was calculated by dividing the number of true negative nondisease 
samples (BPI1/11M) by the sum of the number of false negative plus the 
number of true negative nondisease samples (BPH/HM). The PPV and NPV 
for PCA versus noncanccr (BPH/HM) in the general population were calcu- 
lated as follows: PPV (for population) = sensitivity * rho/[sensitivity * 
rho + (1 - specificity) * (I - rho)]; and NPV (for population) = specificity 
* (1 - rho/[specificity) * (1 - rho) + (I - sensitivity) * rho], where rho is 
prevalence in the population. 

Results 

Data Analysis. Peak detection using the SELDI software program 
detected 63,157 peaks in the 2-40-kDa mass range after analysis of 
772 spectra (386 spectra in duplicate, with approximately 8 1 peaks/ 
spectrum). Of these, 779 peaks were identified after the clustering and 
peak alignment process. The AUC was calculated for each of the 779 
peaks. No single peak was identified that had an AUC of 1.0, indi- 
cating that there was not a peak detected that alone could completely 
separate two groups (i.e., HM versus PCA, HM versus BPH, or BPH 
versus PCA) or three groups (PCA versus BPH versus HM). Of the 
779 peaks, 124 had an AUC > 0.62. Those with an AUC < 0.62 were 
considered irrelevant for classification. These 124 peaks identified in 
the training set were then used to construct the decision tree classifi- 
cation algorithm. Fig. 1 is a flow diagram that summarizes the process 
from peak detection to sample classification. The classification algo- 
rithm used nine masses between 4 and 10 kDa (4475, 5074, 5382, 
7024, 7820, 8141, 9149, 9507, and 9656 Da) to generate 10 terminal 
nodes (LI— L10; Fig. 2A). Once the algorithm identifies the most 
discriminatory peaks, the classification rule is quite simple. For ex- 
ample, if an unknown sample has no peak at mass 7819.75 ("root" 
node) but has a peak at mass 7024.02, then the sample is placed in 
terminal node LI and classified as PCA. If the sample is placed in L2, 
it will be assigned to BPH- Another example of this splitting process 
is shown in Fig. 2B, in which four masses between 5 and 10 kDa are 
used to assign 46 of the 167 PCA samples to terminal node L7. Based 
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set. A. diagram of decision tree analyses. The root node (lop) and descendant nodes are 
shown as ovals, and the terminal nodes (L1-L10) are shown as rectangles. The numbers 
in each node represent the classes [top number, number of HM (normal control) samples; 
middle number, number of PCA samples; bottom number, number of BPH samples]. The 
first number under the root and descendant nodes is the mass value followed by the peak 
intensity value. For example, the moss value under the root node is 7819.75 kDa, and the 
intensity is ^0. B, representative example of a SELDI spectrum showing the combination 
of four peak masses required to correctly classify the sample as PCA in the L7 terminal 
node. The arrows in the magnified panels identify the protein peaks used in the classifier, 
and the numbers 1-4 in the lop right corner indicate the order the decision tree takes in 
assigning the sample to the L7 terminal node. The first number under each panel is the 
mass, and the second number is the peak intensity. C, example of the reproducibility of 
the SELDI and dcci i f I I crura ample randomly selected and 

repeated IS months (S) after the initial SELDI analysis (A) showed similar spectra and 
were correctly classified to the appropriate terminal node by the decision tree algorithm; 
in this example either terminal node LI, L2, or L6. Nl, sample from a healdiy male donor; 
Bl, sample from a patient with BPH; CI, sample from a patient with PCA. 
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on the stochastic nature of reality, misclassification of a new sample 
cannot be ruled out even for a pure node that contains only one sample 
type, for example, L2, which contains only BPH samples. To obtain 
an idea of whether an unknown sample would be correctly classified 
or misclassified, the expected probability and 95% confidence level 
was calculated for each class in the 10 terminal nodes (Table 1). The 
expected probabilities for HM and PCA samples to be misclassified in 
L2, for example, are 1.67%. Although uot zero, the likelihood of HM 
or PCA samples being assigned to this node is extremely low; whereas 
BPH has a 96.67% chance of being correctly classified to L2 (with the 
95% confidence interval between 90.72% and 99.52%). The proba- 
bility of incorrect assignment of samples increases in nodes that 
contain few majority samples or when only a few samples are as- 
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signed to the node, as, for example, terminal nodes L3, L5, and L9 
(Fig. 1A). 

A summation of the classification results from the 10 terminal 
nodes is presented for the training and test sets in Table 2. The 
classification algorithm correctly predicted 93.5 1 97.59% of the sam- 
ples for each of the three groups in the training set (Table 2A), for an 
overall correct classification of 96%. The algorithm correctly pre- 
dicted 90% (54 of 60) of the test samples, with all 15 samples from 
HM, 93% (14 of 15) of the BPH samples, and 83% (25 of 30) of the 
PCA samples being correctly classified (Table 2B). Three of the 
misclassified HM cases in the training set had PSA values < 2.5 
ng/ml (i.e., 0.15, 0.76, and 1.52 ng/ml), considered a low-risk group, 
and the fourth case had a PSA of 3.02 ng/ml (i.e.. high-risk group). 
Therefore, no correlation for the misclassification of four of the HM 
cases with PSA levels could be made. 

The sensitivity and specificity of the classification system for 
differentiation of disease from the nondisease groups are presented 
in Table 2C. When comparing PCA versus noncancer (BPH/HM), 
the sensitivity was 83% (25 of 30), and the specificity was 97% (29 
of 30). A sensitivity of 83% was also obtained when comparing 
PCA versus HM (25 of 30) or PCA versus BPH (25 of 30), whereas 
Ihe specificity was 100% (15 of 15) for PCA versus HM and 93% 
(14 of 15) for PCA versus BPH. The PPV and NPV for the study 
population were 96.15% and 96.67%, respectively. When consid- 
ering an estimated 30% prevalence of PCA in the general popula- 
tion of men age 50 years or older (17), the PPV is 91.15%, and the 
NPV is 93.12%. 

Reproducibility. The reproducibility of SELDI spectra, i.e., mass 
location and intensity from array to array on a single chip (intra-assay) 
and between chips (interassay), was determined using the pooled 
normal serum quality control sample. Seven proteins in the range of 
3,000-10,000 Da observed on spectra randomly selected over the 
course of the study were used to calculate the coefficient of variance. 
The intra-assay and interassay coefficient of variance for peak loca- 
tion was 0.05%, and the intra-assay and interassay coefficient of 
variance for normalized intensity (peak height or relative concentra- 
tion) was 15% and 20%, respectively (data not shown). Masses that 
were within 0. 1 8% mass accuracy between spectra were considered to 
be the same. Most important was the observation that randomly 
selected samples, blinded to the person performing SELDI and rerun 
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months or even a year later, were correctly classified by the decision 
tree classification algorithm (Fig. 2Q. 

Discussion 

The current standard screening approach for PCA is a serum test for 
PSA, and if the test is positive, biopsies are obtained from each lobe 
of the prostate. Although the PSA test has a sensitivity of >90%, its 
specificity is only 25%. This low specificity results in subjecting men 
to biopsies of the prostate as well as considerable anxiety when they 
do not have PCA detectable by biopsy. With the SELDI profiling 
classification approach, an overall sensitivity of 83%, a specificity of 
97%, and a PPV of 96% were obtained in differentiating PCA from 
BPH and age-matched unaffected HM. Provided that this SELDI 
profiling classification system can be validated using a larger and 
more clinically diverse study set, this approach would have immedi- 
ate and substantial benefit in reducing the number of unnecessary 
biopsies. 

Our successful development of a diagnostic system that achieved a 
high PPV (96%) for the blinded test set is based on using a large, 
carefully chosen training set of randomly selected samples. All spec- 
imens were closely age matched. Serum samples from unaffected 
HM, identified as men with a negative DRE and PSA <4.0 ng/ml, 
were obtained from the general population during free prostate screen- 
ing clinics. Nevertheless, selecting a cancer-free control population 
for studies described herein is difficult. It is unusual for a man with a 
normal PSA and normal DRE to undergo a prostate biopsy to be 
certain that the controls are truly negative. About the best that can be 
done is to select healthy controls that have potentially the lowest risk 
for PCA. For this study, 86 of the 96 HM cases had PSA values <2.5 
ng/ml, which is considered a lower-risk group. The majority of the 
BPH patients had 4-10 ng/ml PSA and multiple negative biopsies, 
and the PCA patients had cancers ranging from small volume local- 
ized disease to local and distant metastatic disease and PSA values 
varying from 0 to >8000 ng/ml. Another important factor in the 
construction of a successful classification system was using an algo- 
rithm that could filter out the "noise" that is characteristic of mass 
spectrometry instruments, the spurious signals created by the EAM 
and chemical contaminates introduced in the assay, and the natural 
random daily fluctuations and sample-to-sample variability. This 
"normalization" process is critical in distinguishing peaks due to 
artifacts from the true peptide/protein peaks. It becomes even more 
important when considering that most all of the protein alterations 
between the cancer and noncancer cohorts are based on the overex- 
pression or underexpression of proteins and not solely on their pres- 
ence or absence. We believe that accurate and reproducible feature 
selection or peak "picking" algorithms with normalization functions is 
the most critical first step in developing a successful classification 
algorithm for the SELDI profiling data. 

It was encouraging that the three study cohorts could be separated 
based on the overexpression or underexpression of nine peptide/ 
protein masses. However, it was not surprising that multiple bio- 
markers would be required to effectively deal with the problem of 
tumor microheterogeneity that has plagued so many biomarker inves- 
tigations. A previous study from our laboratory (12) is, to the best of 
our knowledge, the first report describing the concept of SELDI 
protein profiling as a potential diagnostic approach. This study ob- 
served that the selection of a combination of multiple proteins re- 
solved by SELDI dramatically improved the detection rate of early- 
stage bladder cancer compared with a single marker (i.e., urine 
cytology). Although the differential analysis in this latter study was 
conducted by cluster analysis and laborious manual visual inspection 
of all spectra, it did, however, demonstrate the power of SELDI 



profiling to facilitate the discovery of better cancer biomarkers. Fur- 
thermore, it clearly illustrated the need for a bioinformatics algorithm 
to effectively deal with the high dimensionality of the SELDI data. 
Based on the results of this previous study, we have explored several 
different bioinformatics models to mine and analyze the large 
amounts of data generated from these clinical protcomic studies. The 
models have included purely biostatistical algorithms, genetic cluster 
algorithms, support vector machines, and decision classification trees. 
All have obtained between 83-90% accuracy in separating PCA from 
the noncancer (BPH/HM) samples. 8 The classification tree model was 
selected because it is easy to interpret and the results can be clearly 
presented compared with "black box" classifiers such as neural net- 
works and biostatistical algorithms, specifically with regard to the 
problems associated with the deconvolution steps required in identi- 
fying the protein peaks used in the classifiers. With the decision tree 
algorithm, the protein peaks used in the classifier are easily attainable 
by examination of the rules, and these rules are easily validated by 
examination of the SELDI processed spectra. Further proof of concept 
that coupling an artificial intelligent learning algorithm to analyze 
SELDI profiling data has potential as a diagnostic test is the recent 
report describing the use of a modified generic algorithm that 
achieved a PPV of 94% in differentiating ovarian cancer from benign 
ovarian disease and healthy unaffected women (1 8). The discriminator 
pattern for classification of ovarian cancer in the study of Petricoin el 
al. (18) consisted of five protein masses of 534, 989, 21 1 1, 2251, and 
2465 Da. Although they used hydrophobic chip chemistry, which 
might be expected to bind some different proteins than those that 
would bind to the IMAC-3Cu chip used in the present study, it is 
interesting to note that the masses are distinctly different from those 
used in the prostate classification system. This suggests that the 
SELDI protein fingerprint profiling approach is detecting different 
protein patterns for each type of cancer. Studies in progress in our 
laboratory strongly suggest that this may be the case. We have 
observed that SELDI profiles of breast cancer, ovarian cancer, bladder 
cancer, and leukemia are different from each other and from the 
prostate classification profile described in this report. 9 To assure the 
robustness of our diagnostic system, the prostate classification algo- 
rithm is being challenged with non-PCAs and non-prostate diseases to 
determine that the protein profiling classification algorithm is specific 
for PCA. A similar scheme will be required of any disease-specific 
classification system. 

One of the goals of this study was to identify markers in the prostate 
proteome that could potentially be used for early detection of cancer. 
Ongoing studies in our laboratory evaluating longitudinal serum sam- 
ples over a 5-10-year period suggested that PCA may be suspected 5 
or more years earlier than by PSA testing. 10 If validated with a larger 
number of patients, such studies will support the SELDI classification 
system as an early diagnostic test. However, to effectively apply this 
classification system for early detection, it will be essential to identify 
other biomarkers that can distinguish the aggressive cancers, i.e., 
clinically important cancers, from nonaggressive cancers. Current 
evidence suggests that preoperative serum PSA <10 ng/ml is not a 
useful biomarker for predicting the presence, volume, grade, or rate of 
postoperative failure (4, 19). Thus, there is an urgent need for a better 
biological marker than PSA and all its molecular forms have been able 
to provide. A marker proportional to the volume of Gleason grade 4/5 
(undifferentiated cancer) represents a critical need to more logically 
direct therapy tailored to tumor biology. Studies ai 
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laboratory to evaluate SELDI serum spectra of pre- and postprosta- 
teclomy samples from patients who, after treatment, have biochemical 
evidence for recurrent disease in an effort to identify the biomarkers 
or risk factors that signal an aggressive cancer. 

The successful use of the prostate classification system described 
herein relics entirely on the protein fingerprint pattern of the nine 
masses. Because these masses were found lo be reproducibly reliably 
detected, only the mass values are required to make a correct classi- 
fication or diagnosis. Knowing their identities for the purpose of 
differential diagnosis is not required. However, because knowing their 
exact identities will be essential for understanding what biological 
role these peptide/proteins may have in the oncogenesis of PCA, 
potentially leading to novel therapeutic targets, efforts are under way 
to purify, identify, and characterize these protein/peptide biomarkers. 
Furthermore, knowing their identities will be essential for producing 
antibodies for development of either classical or SELDI immunoas- 
says, similar to the single and multiplex formats we described previ- 
ously for the quantitation of PSA and prostate-specific membrane 
antigen (12, 20). The SELDI immunoassay format provides an alter- 
nate platform for quantitation of multiple biomarkers. 

The high sensitivity, specificity, PPV, and NPF obtained by the 
serum protein profiling approach presented in this study demonstrate 
that SELDI protein chip mass spectrometry combined with an artifi- 
cial intelligence classification algorithm can both facilitate discov- 
ery 1 1 of better biomarkers for prostate disease and provide an inno- 
vative clinical diagnostic platform that has the potential to improve 
the early detection and differential diagnosis of PCA. 
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ABSTRACT 

Purpose: The objective of this study was to discover 
protein biomarkers that differentiate malignant from non- 
malignant cell populations, especially early protein alter- 
ations that signal the initiation of a developing cancer. We 
hypothesized that Surface Enhanced Laser Desorption/ 
Ionization-time of flight-mass spectrometry-assisted protein 
profiling could detect these protein alterations. 

Experimental Design: Epithelial cell populations [be- 
nign prostatic hyperplasia (BPH), prostate intraepithelial 
neoplasia (PIN), and prostate cancer (PCA)] were procured 
from nine prostatectomy specimens using laser capture 
microdissection. Surface Enhanced Laser Desorption/ 
Ionization-time of flight-mass spectrometry analysis was 
performed on cell lysates, and the relative intensity levels of 
each protein or peptide in the mass spectra was calculated 
and compared for each cell type. 

Results: Several small molecular mass peptides or pro- 
teins (3000-5000 Da) were found in greater abundance in 
PIN and PCA cell lysates. Another peak, with an average 
mass of 5666 Da, was observed to be up-regulated in 86% of 
the BPH cell lysates. Higher levels of this same peak were 
found in only 22% of the PIN lysates and none of the PCA 
lysates. Expression differences were also found for intracel- 
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lular levels of prostate-specific antigen, which were reduced 
in PIN and PCA cells when compared with matched nor- 
mals. Although no single protein alteration was observed in 
all PIN/PCA samples, combining two or more of the mark- 
ers was effective in distinguishing the benign cell types 
(normal/BPH) from diseased cell types (PIN/PCA). Logistic 
regression analysis using seven differentially expressed pro- 
teins resulted in a predictive equation that correctly distin- 
guished the diseased lysates with a sensitivity and specificity 
of 93.3 and 93.8%, respectively. 

Conclusions: We have shown that the protein profiles 
from prostate cells with different disease states have dis- 
criminating differences. These differentially regulated pro- 
teins are potential markers for early detection and/or risk 
factors for development of prostate cancer. Studies are un- 
der way to identify these protein/peptides, with the goal of 
developing a diagnostic test for the early detection of pros- 
tate cancer. 

INTRODUCTION 

Prostate cancer is the most common cancer and, second to 
lung cancer, causes the greatest number of cancer deaths in 
American males (1). The PSA 3 serum test has contributed to 
earlier detection, however, 65-75% of moderately elevated PSA 
levels are attributed to BPH, often resulting in unnecessary 
biopsies (2). Several approaches have been undertaken to im- 
prove the PSA test such as measuring PSA velocity (3), PSA 
density (4), and assessing ratios between free, complexed, and 
total PSA serum values with various degrees of success (5). 
Combinations of markers such as free PSA, IGF-I, and IGF- 
binding protein 3 have resulted in improved diagnostic discrim- 
ination between BPH and prostate cancer (6). It is becoming 
increasingly clear that because of the inherent molecular heter- 
ogeneity and multifocal nature of prostate cancer (7), additional 
improvement in early detection, diagnosis, and prognosis will 
likely require the measurement of a panel of biomarkers. 

The proteome is the full complement of proteins that reg- 
ulate the physiological and pathophysiological phenotype of a 
cell. Because proteins initiate all cell functions and pathways, 
identifying differentially expressed proteins between normal 
and pathological states can lead to a better understanding of the 



3 The abbreviations used are: PSA, prostate-specific antigen; BPH, 
benign prostatic hyperplasia; IGF, insulin-like growth factor; LCM, 
laser capture microdissection; MS, mass spectrometry; TOF, time of 
flight; SELDI, Surface Enhanced Laser Desorption/Ionization; PIN, 
prostate intraepithelial neoplasia, PCA, prostate cancer; IMAC, Immo- 
bilized Metal Affinity Capture; N, normal. 
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cellular mechanisms involved in disease. Some proteins are 
down-regulated and others are up-regulated with the onset of 
disease, depending on a protein's specific function, whereas 
others undergo disease-specific posttranslational modifications 
(8-10). The identification of changes in protein expression and 
modification that occur in the early stages of a developing 
cancer could lead to the discovery of protein biomarkers 
and novel strategies for the improvement of early detection, 
diagnosis, and therapy of cancer. Therefore, examining the 
proteome of a cell holds great potential for the discovery of new 
biomarkers. 

As a result of the microheterogeneity of organ-based can- 
cers, studies of pure cell populations are required to achieve 
precision in the search for disease-associated biomarkers. LCM 
microscopes have been used successfully for the procurement of 
pure populations of cells for genetic analysis (11, 12), protein 
expression changes in cancer cells using two-dimensional elec- 
trophoresis (13), and MS (14-16). Advances in MS have lead to 
the evolution of several proteomic applications: from the map- 
ping of peptide digests of proteins isolated from two-dimen- 
sional electrophoresis to direct and rapid proteome profiling of 
cells and body fluids. For example, matrix-assisted desorption 
ionization-TOF-MS has been used to look for protein changes in 
breast cancer cell lines (17) and in the serum of cutaneous 
melanoma patients (18). In addition, tandem MS systems are 
capable of extracting peptide sequence information for sequence 
tagging and protein identification (19). Another innovative MS 
technology, SELDI, has been used to compare the patterns of 
protein expression in two physiological states of Yersinia pestis 
(20) and in the profiling of amyloid (3 peptide variants (21). Our 
laboratory has successfully applied SELDI to the identification 
of specific protein changes in the urine of bladder cancer pa- 
tients (22) and the detection of prostate cancer-associated bi- 
omarkers, PSA, prostate-specific membrane antigen, prostate 
acid phosphatase, and prostate secretory protein in cell lysates, 
serum, and seminal plasma (16). Using the various affinity 
surfaces of Proteinchip arrays, SELDI can reduce complex 
protein mixtures to sets of proteins with common properties. 
The advantage of the SELDI protein profiling method is the 
ability to simultaneously detect multiple protein changes with a 
high degree of sensitivity (pmol to amol; Ref. 23) in a rapid high 
throughput process. Clear spectra are obtained with predomi- 
nately singly charged ions and mass deviations of <0.02% for 
internally calibrated spectra (24, 25). This precision makes it 
possible to delineate very small proteins and peptides, as well as 
differential posttranslation modifications such as phosphoryla- 
tion and glycosylation (26). Recently, SELDI protein profiling 
has been shown to provide reproducible and specific protein 
patterns of LCM cell lysates harvested from different cancer 
types and grades (15, 27). 

This report describes the combinatorial use of LCM and 
SELDI technologies to detect protein differences in cell lysates 
of pure populations of normal, benign (BPH), premalignant 
(PIN), and malignant prostate (PCA) cells. The objectives of 
this study were to discover potential biomarkers that could be 
used to differentiate malignant from the nonmalignant cell pop- 
ulations, especially early protein alterations that signal the ini- 
tiation of a developing cancer. The latter would be especially 
useful as potential markers for early detection and/or as risk 



factors for development of prostate cancer. Differential expres- 
sion of several individual protein peaks was observed for BPH, 
PIN, and PCA epithelial cells with respect to the expression 
levels found in matched normal epithelial cells. Combinations of 
these signature or differentially regulated proteins/peptides 
could distinguish PCA and PIN from normal and BPH. How- 
ever, in most cases, it was difficult to differentiate PCA from 
high-grade PIN. Thus, these protein alterations could represent 
early signals of a developing malignant lesion and may be useful 
as markers of early detection. 

MATERIALS AND METHODS 

Patient Specimens. Prostate tissues were procured from 
consenting patients undergoing radical prostatectomy. The age 
of the patients ranged from 44 to 68 years and consisted of five 
Caucasians and four African Americans. The tissues were pro- 
cessed immediately and stored in the Virginia Prostate Center's 
Bio-repository. Tissue pieces harvested for LCM were immedi- 
ately embedded in optimal cutting temperature compound and 
stored at -80°C. One cryosection was H&E stained and exam- 
ined by a pathologist to identify cells of interest for microdis- 
section. Mirrored-stained sections, fixed in formalin and paraf- 
fin embedded, were also used to further aid in the identification 
of specific cell types. Additional serial frozen sections at 8 u.m 
were used for microdissection. 

LCM. Pure populations of normal luminal epithelia, 
BPH, PIN, and PCA epithelial cells were microdissected from 
frozen tissue sections using the PixCell II Laser Capture Micro- 
dissection Microscope (Arcturus Engineering, Inc., Mountain 
View, CA) essentially as described by Emmert-Buck et al. (28). 
The procedure for staining frozen sections for LCM was fol- 
lowed with slight modifications: the hematoxylin step was omit- 
ted and protease inhibitors (Complete; Roche Biomedical Indi- 
anapolis, IN) were added to the staining baths. A total of 1000 
laser shots totaling 3000-6000 cells was procured for each cell 
type. Matched benign and diseased epithelial cell types were 
harvested from each prostate sample. In some cases, stroma 
cells were also procured from the same section directly adjacent 
to the cells of interest. Each cell population was estimated to be 
>98% homogeneous based on careful examination of captured 
cells by the pathologist. Samples were standardized by total 
number of laser shots, and duplicate samples were captured 
from the same areas of each serial section to check reproduc- 

Cell Lysates and SELDI Proteinchip Array Binding. 

Cell lysates were immediately prepared after microdissection by 
adding 4 |xl of a lysis buffer containing 20 mki HEPES (pH 8.0) 
with 1% Triton X-l 00 directly on the LCM cap. Each lysate was 
diluted 1:10 in PBS buffer, giving a total volume of 40 pi 
Lysates were vortexed for 10 min at 4°C and centrifuged briefly 
to remove cellular debris. The supernatant was added to an 
IMAC3 Proteinchip Array (Ciphergen Biosystems, Inc., Fre- 
mont, CA), pretreated with 100 dim CuSO„ following the man- 
ufacturer's instructions. This surface was chosen because it 
produced the most robust spectra of the LCM lysates and for its 
ability to bind phosphorylated proteins. A bioprocessor (Cipher- 
gen Biosystems, Inc.) was fitted on top of the chip arrays to 
permit the addition of the 40-pJ sample. To control for variation, 
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cell lysates harvested from each prostate tissue were analyzed 
on a single biochip. The array was then incubated with the 
diluted lysatc overnight at room temperature on an orbital 
shaker. After removal of" the lysate, each spot was washed twice 
with PBS, followed by a final water rinse. 

SELDI Analysis. The arrays were allowed to air dry, and 
a saturated solution of sinapinic acid (Ciphergen Biosystems, 
Inc.) in 50% (v/v) acetonitrile and 0.5% (v/v) trifluoroacetic 
acid was added to each spot. TOF mass spectra were generated 
in a Ciphergen Protein Biology System II by averaging 120 laser 
shots collected in the positive mode at laser settings of 225 and 
280. Data were calibrated externally using purified peptide and 
protein standards. 

Protein Profile Evaluation and Peak Expression Scor- 
ing. Spectra were analyzed with the Ciphergen Peaks 2.1 
software and relative abundance for each peak were calculated 
as follows. The relative abundance of the proteins was subdi- 
vided into three classes: low (+), 1-30% of spectral scale; 
medium (++), 31-60%; and high (+ + +), 60-100% and an- 
alyzed for each matched set of cell types. Numerical values were 
then assigned to the abundance levels (i.e., (+), 33; (++), 66; 
and (+ ++), 100) and averaged for each cell type to represent 
the protein expression between prostate cell types in graphical 

Statistical Analysis. Sensitivity is defined as the per- 
centage of diseased (BPH/PIN/PCA) cell types for which the 
biomarker of interest is present (true positive/total number of 
diseased lysates x 100). Specificity is defined as the percentage 
of cell types for which the biomarker of interest is not positive 
(true negative/total number of lysates without disease X 100). 
The statistical significance of the differences in peak expression 
scores between all possible pairs among the four cell types was 
calculated using the Wilcoxon signed rank test. A logistic re- 
gression analysis was then performed using the most significant 
differentially expressed proteins. 

RESULTS 

Sample Harvesting and SELDI Profile Evaluation. 
Pure populations of organ-matched benign (normal or BPH), 
PIN (high grade), and PCA epithelial cells were obtained from 
nine prostatectomy specimens examined. In four of the prostate 
specimens, all four cell types were identified and harvested, and 
PIN cells were obtained for all nine prostate tissues. The total 
number of cell types for each group was as follows: eight 
normal, seven BPH, nine PIN, and seven PCA. An average of 
5000 cells was microdissected in duplicate for each cell type, 
resulting in 62 cell lysates analyzed by SELDI-TOF-MS. In two 
samples, we were able to microdissect two different foci of PIN 
and PCA. Additionally, adjacent stroma cells were microdis- 
sected from a selected subset of tissues to compare with the 
epithelial cell profiles. 

Visual Analysis of SELDI Data Revealed Differential 
Protein Profiles. Processing the lysates on an immobilized 
metal affinity capture surface pretreated with CuS0 4 resolved 
between 50 and 90 protein or peptide peaks in the mass range of 
3 to 100 kDa. Fig. 1 is a representative spectrum of the protein 
profile of a PCA cell lysate. The advantages of the SELDI 
technology over two-dimensional electrophoresis in resolving 



molecular mass protein or peptide species below mlz 10,000 (10 
kDa) is evident. The protein profiles of each set of matched 
lysates were compared visually to identify differences. Expres- 
sion profiling of the samples revealed several protein pattern 
differences, indicating up- and down-regulation or possible al- 
tered protein processing between the prostate cell types. Fig. 1A 
is the SELDI spectra and gel-view, showing a differentially 
expressed group of proteins between 4000-6000 Da in epithe- 
lial cells obtained from the same prostate tissue specimen. Three 
peaks (4030, 4358, and 4753 Da) are present or up-regulated in 
the PIN and PCA lysates. Fig. IB is a composite SELDI gel- 
view of matched cell types obtained from three different pros- 
tatectomy specimens exhibiting increased expression of a peak 
at an average mass of 4749 Da in the PIN and PCA samples. 
Duplicate samples exhibited a high degTee of reproducibility. In 
contrast, regions of heterogeneity were present in the protein 
profiles derived from two different foci of PCA in patient 2 and 
two foci of PIN in patient 3. However, overexpression of the 
4749 Da protein is still observed. Profile differences were also 
found in the BPH cell lysates. Fig. 2C is an example of a peak 
at 5666 Da, which appears to be up-regulated in the BPH 
epithelial cells when compared with the spectra of the other 
matched cell types. 

Differential Expression of Intracellular PSA Was Ob- 
served between Prostate Cell Types. To assess the potential 
of the SELDI technology, initial studies in our laboratory fo- 
cused on the detection of known prostate cancer markers (16). 
Prostate-specific membrane antigen, prostate acid phosphatase, 
prostate secretory protein, and PSA were detected from cell 
lysates and body fluids. In this study, a peak of average mass 
28.4 kDa was detectable in the protein profiles of epithelial cells 
using the IMAC surface pretreated with CuS0 4 . This mass is 
consistent with the molecular mass of free PSA (29) and has 
been confirmed as such in our laboratory from LCM prostate 
cells lysates using a SELDI immunoassay (data not shown). 

To determine how the expression of intracellular PSA 
differed between each cell type, the PSA peaks were compared 
in the protein profiles of organ-matched sets of lysates. Differ- 
ential PSA expression between the organ-matched cell types 
was observed. In 5 of 9 of the PIN samples and 4 of 7 of the 
PCA samples, the PSA peak was reduced when compared with 
matched normal epithelia. An example of this decrease in intra- 
cellular PSA levels can be seen in Fig. 3. A large PSA peak at 
28,393 Da was present in the normal prostate epithelia but, as 
expected, is absent from adjacent stroma cells. The PIN and 
PCA cells had a greatly reduced PSA peak. Normal epithelial 
cells procured directly adjacent to the PCA foci, however, 
exhibited a large amount of intracellular PSA. Serum PSA 
values for patients donating tissue for this study were between 
5.5 and 26.2 ng/rnl. Unfortunately, no correlation could be made 
between relative intracellular PSA levels in PCA cells observed 
in the SELDI profiles and serum PSA values. 

Identification of Differentially Expressed Peaks of Di- 
agnostic Value. A visual comparative analyses of the SELDI 
profiles suggested that several proteins exhibited differential 
regulation between the four cell types. However, to more pre- 
cisely select proteins with high biomarker potential, it was 
necessary to standardize peak levels throughout the matched 
populations. The relative abundance of the peaks was subdi- 
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Fig. 1 Representative SELDI spectra 
of one LCM sample from PCA epithe- 
lial cells in m/z ratios of 3000-10000, 
10000-20000, and 20000 - 80000. 



vided into three classes: low (+), 1-33% of spectral scale; 
medium (++), 33-66%; and high (+ + +), 66-100%. Each 
individual spectrum was expanded in specific mass ranges to 
assist in determining the relative levels of each observed peak. 
An in-house computer program was used to cluster the peaks to 
obtain an average mass for each peak within a mass error range 
of 0.15%. Of an average of 70 peaks commonly observed, 14 
(21%) of these peaks displayed some expression differences 
(up- or down-regulation or altered processing) between the 
epithelial cell profiles (see Table 1). Most species in this mass 
range were present in epithelial cell types only with the excep- 
tion of peaks at 3448 Da and the 4361 Da peak, which were also 
present in adjacent stroma cells. The scores in Table 1 were 
converted to numerical values and plotted in a histogram show- 



ing the average differences in expression levels between cell 
types subtracted from the expression level in normal epithelial 
(see Fig. 4). Thus, overexpression and underexpression for each 
peak listed was normalized and evaluated for BPH, PIN, and 
PCA cells. 

Proteins Overexpressed in BPH, PIN, and PCA Identi- 
fied from SELDI Profiles. Generally, the PCA and PIN lev- 
els of expression profiles were similar. As seen in Fig. 4, there 
were higher levels of a group of small molecular mass peptides/ 
proteins between 3000 and 5000 Da in the PIN and PCA 
profiles. The first of these, 3448 Da, was found at increased 
levels in the BPH, PIN, and PCA samples. Three peaks (4036, 
4361, and 4749 Da) were overexpressed in PIN and PCA with 
the highest level of abundance in PCA lysates. A summary of 
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Fig. 2 A, representative spectra and 
gel-view* of matched prostate epi- 
thelial cell lysates (normal BPH, 
PIN, and PCA) showing protein al- 
terations in the range of 4000-5000 
Da (m/z). (Arrows indicate overex- 
pressed peaks in PIN and PCA). 
*Gray scale display of the raw spec- 
tra called gel-view because it looks 
like a stained one-dimensional elec- 
trophoresis gel. ,8, SELDI gel-view 
protein profiles of lysates prepared 
from cells procured from different 
prostatectomy specimens. The box 
identifies a peak with an average 
mass of 4749 Da that appears to be 
overexpresscd in PIN and PCA epi- 
thelial cells. Replicate samplcs(*) 
show good reproducibility of the 
protein patterns. The different pro- 
tein patterns observed for two differ- 
ent PCA foci in patient 2 and the two 
PIN foci in patient 3 may be the 
result of genetic heterogeneity. C, 
representative spectra and gel-views 
of matched prostate epithelial cell 
lysates showing increased expres- 
sion of a peak at mlz of 5666 in the 
BPH sample. N, normal. 
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Fig. 3 Example of protein peaks found between 20,000-70,000 Da. 
Differential expression of a peak at 28,400 Da thought to be PSA was 
observed in epithelial cell lysates but absent in lysates from adjacent 
stromal cells. The intracellular-free PSA appears to be underexpressed 
in the PIN and PCA epithelial cell lysates. 



the percentage of each cell type exhibiting those most signifi- 
cantly increased or decreased proteins is presented in Table 2. 
The peaks at 4036 and 4361 Da were overexpressed in 71% of 
the PCA lysates and 44% of the PIN samples. Additionally, two 
peaks (4639 and 2418 Da) were specific to PCA cell lysates but 
were seen in only 43% of PCA samples tested. The peak at 4749 
Da was up-regulated in 67 and 57% of PIN and PCA lysates, 
respectively, with the highest overall peak expression value in 
the PCA lysates. Overexpression of a 14,696 Da peak was 
present in 56% of the PIN cell lysates. Also of note was a large 
increase in expression of a peak at 5666 Da found in 86% of the 
BPH profiles. Two high molecular mass proteins (48,208 and 
53,836 Da) were also found to have increased expression in the 
BPH cells. 

The significance of the expression differences between 
cell types was evaluated using the Wilcoxon signed rank test 
(Table 3). Differences were found to be significant (P < 



0.05) for BPH versus PIN and BPH versus PCA for overex- 
pression of the 4036 Da peak. A comparison of N versus PCA 
approaches significance (P = 0.059) for this peak. Addition- 
ally, significant differences were observed for N versus PCA 
and BPH versus PIN in overexpression of the 4361 Da peak. 
Overexpression of the 4749 Da peak was significant for N 
versus PIN and approaches significance for BPH versus PIN. 
The 5666 Da protein was significantly up-regulated when 
comparing N versus BPH and, interestingly, is significantly 
down-regulated in BPH versus PIN and BPH versus PCA. No 
correlation could be made between age or race of the patients 
and the appearance of differentially expressed or processed 
peaks. 

Proteins Underexpressed in BPH, PIN, and PCA Pro- 
files. As seen in Fig. 4, a few proteins were underexpressed in 
BPH, PIN, and PCA cells when compared with the expression 
levels in normal epithelial cells. Expression of the 5666-Da 
peak, which was overexpressed in BPH profiles, was reduced in 
22% of PIN and 71% of PCA cell lysates at significant levels 
(see Tables 2 and 3). In addition, decreased expression of an 
1 1,744-Da protein and PSA (28,442 Da) was found in the BPH, 
PIN, and PCA cell lysates. The reduced expression of intracel- 
lular PSA was found in 56 and 57% of PIN and PCA cell 
lysates, respectively. Furthermore, this underexpression of PSA 
was significant in N versus PIN and approaches significance 
(P = 0.066) for N versus PCA (Table 3). 

Biomarker Combination of Identified Peaks Improved 
Prediction of Cell Lysate Disease State. Because no single 
peptide or protein was discovered to be differentially expressed 
in all of the PIN or PCA profiles, various combinations of 
selected proteins were evaluated to identify a panel of markers 
(expression levels) that could improve disease classification of 
each specific cell type (Table 4). A biomarker combination was 
classified as positive if any marker in the combination was 
present in a sample and negative if none of the markers were 
detected in a specimen. Because most of these markers were 
overexpressed in both PCA and PIN, the combinations did not 
improve the specificity for each of the diseased cell types when 
evaluated individually. 

Better discrimination was achieved by combining the be- 
nign cell types (normal, BPH) and comparing them to diseased 
cell types (PIN or PCA). The sensitivity and specificity of each 
marker for benign versus disease were then calculated both 
individually and in additive combinations. By combining mark- 
ers 1 (p4036) and 3 (p4639), there was an improvement in the 
sensitivity for PCA from 71 to 86% while maintaining a spec- 
ificity of 100% for PIN or PCA combined. This combination of 
markers also identified 44% of the PIN samples. Combining 
marker 2 (p4361 ) and marker 3 (p4639) increased the sensitivity 
for PCA to 100% while the specificity decreased to 87% for PIN 
and PCA. By combining markers 2 and 4 (p4749), 100% of the 
PCA and 89% of the PIN samples could be correctly identified 
with a slight decrease in specificity. Combinations involving 
three or more of the markers did not improve the overall 
specificity and sensitivity. The 5666-Da protein (marker 5, 
Table 4) had a sensitivity of 86% and specificity of 88% for 
detecting BPH. This marker also provided some discrimination 
between BPH and PIN lysates having 22% sensitivity and 68% 
specificity for PIN epithelia. 
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"Relative expression levels of proteins: +, 1-33% of SELDI spectral scale; ++, 33-66%; and + + +, 66-100%, 0, peak not detected. 
* N, normal. 



Multivariable analysis of the peak expression data was 
evaluated to determine whether the simultaneous overexpres- 
sion or underexpression of several proteins in combination 
could be used to predict benign versus diseased cell lysales. 
Logistic regression analysis was performed with the seven most 
significant differentially expressed peaks [4,036, 4,361, 4,413, 
4,639, 4,729, 5,666, and 28,422 Da (PSA)]. As seen in Table 5, 
a predictive equation based on diseased (PIN/PCA) versus be- 
nign (normal/BPH) cell lysates resulted in 93.3% specificity and 
93.8% sensitivity for PIN or PCA cell lysates. Therefore, these 
biomarkers in combination may have clinical value, especially if 
detectable in biopsy or body fluid samples 

DISCUSSION 

The development of cancer is a multistep process encom- 
passing multiple events involving oncogenic and tumor suppres- 



sor gene products. These events can occur pre- or posttransla- 
tionally and will be reflected in differential changes in a myriad 
of proteins. Analyzing and interpreting the proteomic changes 
that occur in prostate disease progression is a daunting task 
made even more difficult by the biological heterogeneity of the 
disease. Advances in TOF-MS, resulting in the SELDI technol- 
ogy, have provided an approach for the sensitive and direct 
analysis of proteins in complex biological samples. Previous 
studies in this and other laboratories have demonstrated the 
successful application of this technology to profile a few mi- 
crodissectcd prostate cell specimens (15, 16, 30). In this study, 
we combined LCM with SELDI to generate protein profiles 
from 62 prostate cell lysates derived from nine prostatectomy 
specimens. Our results demonstrate that the protein profiles of 
normal prostate epithelia, BPH, PIN, and PCA display discrim- 
inating differences. 
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Table 2 Percentage of samples (BPH/PIN/PCA) displaying differential expression of selected proteins when compared with matched 
normal samples 



Molecular mass (Da) Overcxprcssion Undcrcxpression Ovcrexprcssion Undcrexprcssion Ovcrexprcssion Undcrcxpression 



4,639 
4,749 
5,666 
8,445 
11,744 
14,696 
24,184 
28,422 
48,308 
53,830 



Protein extracts prepared from prostate cell types were 
analyzed using an IMAC3 protein biochip pretreated with 
CuS0 4 . On average, 70 protein peaks were detected from the 
cell lysates using this surface. Overall, there were a relatively 
large number of common peaks in the benign and diseased 
epithelial cell profiles and very few peaks that would, based on 
presence or absence, be candidate biomarkers for the disease 
progression and/or diagnosis of prostate cancer. It was therefore 
determined that calculating expression levels of peaks would 
enhance the significance of the results and identify possible 
expression differences between the samples. Peak abundance 
levels were calculated for diseased cell profiles as compared 
with levels found in matched benign cell types. Of the common 
70 peaks observed, 15 (21%) of these peaks displayed dysregu- 
lation in the diseased profiles. 



Several small molecular mass peptides or proteins (3000- 
5000 Da) had increased expression levels in the PIN and PCA 
cell extracts. Although, the clusters of peaks in this range could 
originate from proteolysis and cleavage products of larger pro- 
teins, they nonetheless were consistently detected in common 
cell types and were considered part of the general profile. The 
chymotrypsin-Iike activity of PSA has been shown to facilitate 
the proteolysis of semenogelin from seminal plasma (31), and 
IGF-binding protein 3 (32). Such stable cleavage products of 
proteins may be indicative of changes occurring in the prostate 
disease cycle. Furthermore, because the IMAC surface can bind 
phosphorylated peptides or proteins, it is feasible that some of 
the changes observed are attributable to differential phosphoryl- 
ation of the proteins in the diseased cell types. Of interest in this 
study, two peaks were identified with an average mass of 4827 
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Peak (Da) N a _v.»._BPH_ _N vs. PIN N vs. PCA BPHjw. PIN BPH vs. PC A _ PIN vs. PCA 



4,036 


0.317 


0.083 


0.059 


0.025'' 


0.039* 


0.102 


4,361 


0.564 


0.059 


0.038* 


0.046'' 


0.102 


0.083 


4,413 


0.083 


0.157 


0.317 


0.655 


1.000 


0.317 


4,639 


1.000 


1.000 


0.083 


1.000 


0.157 


0.083 


4,749 


0.317 


0.034* 


0.102 


0.059 


0.109 


0.564 


5,666 


0.038* 


1.000 


0.059 


0.024' 


0.039' 


0.408 


28,422 (PSA) 


0.083 


0.041' 


0.066 


0.680 


0.180 


0.564 



* Significant Ps (£0.05) based on overexpression as compared with N or BPH. 
c Significant Ps (£0.05) based on undcrcxprcssion as compared with N or BPH. 



Table 4 Sensitivity an 


d specificity of sin 


gie and combinations of markers for the detection of i 


diseased cells 




Sensitivity 


Specificity 




Sensitivity/Specificity 




PCA 


PIN 


PCA 


PIN 


PIN or PCA 


(Marker) expression" 












(1) increase of m/z 4036 


71% 


44% 


83% 


77% 


56%/l00% 


(2) increase of m/z of 4361 


71% 


44% 


75% 


68% 


S6%/87% 


(3) presence of m/z 4639 


43% 


0% 


100% 


N/A 


19%/100% 


(4) increase of m/z of 4749 


57% 


67% 


71% 


71% 


63%/94% 


Combination 












1 + 2 


86% 


67% 


67% 


64% 


75%/87% 


1 + 3 


86% 


44% 


83% 


77% 


63%/100% 


2 + 3 


100% 


56% 


75% 


68% 


56%/87% 


1 + 4 


86% 


78% 


67% 


68% 


81%/94% 


2 + 4 


100% 


89% 


54% 


55% 


94%/80% 


3 + 4 


86% 


67% 


71% 


68% 


75%/94% 


1+2 + 3 


86% 


67% 


67% 




81%/94% 


1+2 + 4 


100% 


89% 


54% 


59% 


94%/80% 




Sensiti 


vity 


Specificity 








BPH 


PIN 


BPH 


PIN 




(Marker) expression 












(5)* increase of m/z 5666 


86% 


22% 


88% 







" The increase in abundance of each marker was determined from matched normal or BPH epithelial cell lysates. 

* The sensitivity and specificity of a marker found in abundance in BPH samples (5) when compared with PIN and PCA samples. 



Table 5 Classification table based on logistic regression analysis 
using expression levels of seven proteins: [p4036, p4361, p4413, 
p4639, p4749, p5666, and p2842 2 (PSA)] 





Predicted 






(PIN/PCA) 




Observed 


N" Y" 


% correct 


PIN/PCA N 15 


14 1 


93.3 (specificity) 


Y 16 


1 15 


93.8 (sensitivity) 






93.5 (overall) 



" N, no; Y, yes. 



Da ± 26.5 (common in the benign cell types) and 4749 Da ± 
26. 1 (higher abundance in PIN/PCA; Fig. 2A). These two peaks 
may represent a dephosphorylation event occurring in transition 
from benign to PIN/PCA. The average mass shift between these 
proteins (78 Da) is close to the calculated mass shift of 79 Da for 
a phosphorylation event. Prominent examples of aberrant phos- 
phorylation of proteins found in cancer studies include extra- 
cellular signal-regulated kinase 1/2 in breast cancer (33) and 
androgen receptor in prostate cancer (34). 



It is also quite possible that these small molecules could be 
intact functional proteins or peptides, examples of which include 
prohormones, growth factors, amidated peptides, and defensins. 
In a recent study by Rocchi et al. (35), PC-3 and Dul45 prostate 
cancer cell lines were found to produce and secrete a multifunc- 
tional amidated peptide (adrenomedullin, molecular mass ~6 
kDa). In the same study, increased levels of adrenomedullin 
immunostaining were found in PCA epithelia when compared 
with normal epithelia. The activity of the enzyme peptidylgly- 
cine ct-amidating monooxygenase was also demonstrated in 
prostate cancer cell lines. This enzyme produces a-amidated 
bioactive peptides from their inactive glycine-extended precur- 
sors. The importance of the role of these small molecular mass 
proteins may have previously been overlooked as a result of the 
difficulty in detection using two-dimensional analysis. 

Previous studies have shown a cytogenetic link between 
high-grade PIN and prostate cancer strengthening its role as a 
precursor lesion (36). In addition, >50% of patients with high- 
grade PIN present with cancer detected in a subsequent biopsy 
(37). Therefore, the identification of proteins specifically asso- 
ciated with PIN would have tremendous impact as markers for 
the early detection of prostate cancer. In our study, PIN and 
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PCA cell lysates exhibited similar protein profiles underscoring 
the phenotypic similarity of these two disease states. Three 
peaks at 4036, 4361 , and 4749 Da showed increased abundance 
in PIN and PCA lesions when compared with matched benign 
cell profiles. Two other peaks at 1 1 ,744 and 28,442 Da (free 
PSA) were decreased in BPH, PIN, and PCA cell extracts, and 
one marker at 14,696 Da was overexpressed in 56% of the PIN 
samples and only 29% of matched PCA samples. Interestingly, 
a few of these peaks, based on closely matched molecular 
masses, were found to be present in serum and seminal plasma 
from two of the patients donating tissue for this study (data not 
shown). Because these body fluid profiles were generated using 
the IMAC surface pretreated with CuS0 4 , they may represent 
the same proteins. However, additional samples will need to be 
examined to confirm this result. If the markers discovered in the 
tumor cell lysates can be detected in serum or seminal plasma, 
they may aid in the early detection/diagnosis of prostate cancer. 
The identification of these peptides or proteins and their use as 
possible markers of early detection is currently under investi- 
gation in our laboratory. 

In this study, most of the markers found in the PCA profiles 
were also present in the PIN profiles, and thus, the ability to 
discriminate between these two cell types was difficult. How- 
ever, better discrimination could be achieved between the be- 
nign cell types (normal, BPH) and the diseased cell types (PIN 
or PCA combined). Because it is well established that multiple 
foci of PIN and PCA arise independently within the same 
prostate and prominent genetic heterogeneity is a common fea- 
ture of prostate disease, a panel of biomarkers is the most likely 
solution to improvements in early detection and diagnosis. In 
maximizing the use of our approach, we explored a combination 
of biomarkers with the most significant differential expression. 
Combining markers 4361 and 4749 Da improved the sensitivity 
to 100% for the detection of PIN and PCA while maintaining 
87% specificity. When we incorporated the seven most differ- 
entially expressed proteins in a logistic regression analysis, a 
predictive equation resulted in 93.3% sensitivity and 93.8% 
specificity for PIN or PCA. One of the seven peaks (4,639 Da) 
and an additional peak (24,184 Da) were found only in PCA cell 
lysates. Each of these markers was expressed in 43% of tumor 
samples profiled. Because the majority of our tissue samples 
were moderately differentiated cancer (combined Gleason 
scores of 6 or 7), no correlation could be made to Gleason grade 
of tumor with regard to the expression of the peaks we found to 
be differentially expressed in PCA. Future studies involving the 
protein profiles of poorly differentiated and metastatic prostate 
cancer samples would determine whether any of the selected 
biomarkers represent markers of metastatic potential. Identify- 
ing if a patient has a clinically significant or insignificant cancer 
could feasibly be determined by an antibody array analysis of 
the patient's biopsy (i.e., lysate) using antibodies to selected 
biomarkers. 

Differences were observed in the expression of a 
28,400-Da peak, which is consistent with intracellular-free PSA. 
The molecular mass identified in our study closely matches the 
observed molecular mass of free PSA (28,430 Da), determined 
using ion spray MS (29). Likewise, this peak was absent from 
matched adjacent stroma cell lysates, indicating its specific 
expression in epithelial cells. Furthermore, data obtained from 



our immunoassay studies, also performed using the SELDI 
platform (16), have identified this peak from LCM cell lysates 
as PSA based on immunoaffinity (data not shown). In this study, 
normal epithelia of the prostate expressed large amounts of 
PSA, whereas the diseased cell types (PIN and PCA) had 
reduced expression levels. Normal epithelia microdissected di- 
rectly adjacent to the tumor foci also expressed high levels of 
PSA. This decrease in intracellular PSA in PCA cells is in 
agreement with other studies. For example, Jung el al. (38) 
found tissue PSA levels lower in cancerous than in normal tissue 
from the same prostate gland, and a study by Weir et al. (39) 
found immunohistochemical staining intensity of PSA inversely 
correlated with histological grade of tumor. Furthermore, sig- 
nificant inverse correlations have been found between tissue 
PSA expression levels and serum PSA values (40). Interest- 
ingly, a recent report by Pawletz et al. (15) also found a 28-kDa 
protein peak (not identified as PSA) via SELDI to be down- 
regulated in microdissected PCA cells when compared with 
matched normal epithelia. Our results are consistent with the 
hypothesis that the increase in serum PSA in men with prostate 
cancer is not because of increased production of PSA by the 
tumor cells but rather an increased leakage of PSA from the 
tumor tissue into the circulation as a result of a breakdown of 
tissue architecture. 

The BPH protein profiles also displayed some notable 
differences. There was an increase in abundance of peaks at 
3448, 4413, and 5666 Da in the BPH lesions. Of special interest 
was the 5666-Da peak, found to be overexpressed in 86% of the 
BPH cell lysates with a specificity of 88%. Only 22% of the PIN 
lesions and none of the PCA lesions overexpressed this marker. 
Efforts are under way to characterize this protein. A biomarker 
indicative of BPH alone may, if secreted into serum or seminal 
plasma, be useful in the reduction of biopsies in patients with 
elevated PSA. 

In conclusion, differential SELDI protein profiles were 
observed for cell lysates prepared from microdissected normal, 
BPH, PIN, and PCA epithelial cells. Several small molecular 
mass species were found to be overexpressed in PIN, and 
because they were also overexpressed in PCA, these proteins 
may represent early signals or signatures of a developing cancer. 
Additionally, one marker at 5666 Da was found to be increased 
in BPH and may have the ability to distinguish BPH from PCA. 
A combination of markers was effective in distinguishing 
normal/BPH from PIN/PCA with a sensitivity and specificity of 
93%. Additional studies are under way to identify and charac- 
terize these potential peptide/protein biomarkers using liquid 
chromatography tandem MS. Once identified, characterization 
of their function and biological role in prostate oncogenesis may 
lead to their potential use as diagnostic and prognostic biomar- 
kers as well as conceivable therapeutic targets. 
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